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1 ATGGCGCAGA AGGGCCAACT CAGTGACGAT GAGAAGTTCC TCTTTGTGGA 

51 CAAAAACTTC ATCAACAGCC CAGTGGCCCA GGCTGACTGG GCCGCCAAGA 

101 GACTCGTCTG GGTCCCCTCG GAGAAGCAGG GCTTCGAGGC AGCCAGCATT 

151 AAGGAGGAGA AGGGGGATGA GGTGGTTGTG GAGCTGGTGG AGAATGGCAA 

2 01 GAAGGTCACG GTTGGGAAAG ATGACATCCA GAAGATGAAC CCACCCAAGT 
251 TCTCCAAGGT GGAGGACATG GCGGAGCTGA CGTGCCTCAA CGAAGCCTCC 

3 01 GTGCTACACA AC CTGAGGGA GCGGTACTTC TCAGGGCTAA TATATACGTA 
351 CTCTGGCCTC TTCTGCGTGG TGGTCAACCC CTATAAACAC CTGCCCATCT 

4 01 ACTCGGAGAA GATCGTCGAC ATGTACAAGG GCAAGAAGAG GCACGAGATG 
451 CCGCCTCACA TCTACGCCAT CGCAGACACG GCCTACCGGA GCATGCTTCA 
501 AGATCGGGAG GACCAGTCCA TTCTATGCAC AGGCGAGTCT GGAGCCGGGA 
551 AAACCGAAAA CACCAAGAAG GTCATTCAGT ACCTGGCCGT GGTGGCCTCC 

)8 601 TCCCACAAGG GCAAGAAAGA CACAAGTATC ACGCAAGGCC CATCTTTTGC 

fU 

S 4 65 i CTACGGAGAG CTGGAAAAGC AGCTTCTACA AGCAAACCCG ATTCTGGAGG 

Ul 

*P 701 CTTTCGGCAA CGCCAAAACA GTGAAGAACG ACAACTCCTC ACGATTCGGC 

si 

„ 751 AAATTCATCC GCATCAACTT CGACGTCACG GGTTACATCG TGGGAGCCAA 

-5, sol CAT TGAG AC C TATCTGCTAG AAAAATCACG GGCAATTCGC CAAGC CAGAG 

851 AC G AGAGG AC ATTCCACATC TTTTACTACA TGATTGCTGG AGCCAAGGAG 



lf7 -5 

'ssis? 



Q 901 AAGATGAGAA GTGACTTGCT TTTGGAGGGC TTCAACAACT ACACCTTCCT 

951 CTCCAATGGC TTTGTGCCCA TCCCAGCAGC CCAGGATGAT GAGATGTTCC 

1001 AGGAAACCGT GGAGGCCATG GCAATCATGG GTTTCAGCGA GGAGGAGCAG 

1051 CTATCCATAT TGAAGGTGGT ATCATCGGTC CTGCAGCTTG GAAATATCGT 

1101 CTTCAAGAAG GAAAGAAACA CAGACCAGGC GTCCATGCCA GATAACACAG 

1151 CTGCTCAGAA AGTTTGCCAC CTCATGGGAA TTAATGTGAC AG ATTT C AC C 

12 01 AGATCCATCC TCACTCCTCG TATCAAGGTT GGG CGAGATG TGGTACAGAA 

12 51 AGCTCAGACA AAAGAACAGG CTGACTTTGC TGTAGAGGCT TTGGCCAAGG 

13 01 CAACATATGA GCGCCTTTTC CGCTGGATAC TCACCCGCGT GAACAAAGCC 

13 51 CTGGACAAGA CCCATCGGCA AGGGGCTTCC TTCCTGGGGA TCCTGGATAT 

14 01 AGCTGGATTT GAGATCTTTG AGGTGAACTC CTTCGAGCAG CTGTGCATCA 
14 51 ACTACACCAA CGAGAAGCTG CAGCAGCTCT TCAACCACAC CATGTTCATC 
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1501 CTGGAGCAGG AGGAGTACCA GCGCGAGGGC ATCGAGTGGA ACTTCATCGA 
1551 CTTTGGGCTG GACCTACAGC CCTGCATCGA GCTCATCGAG CGACCGAACA 
1601 ACCCTCCAGG TGTGCTGGCC CTGCTGGACG AGGAATGCTG GTTCCCCAAA 
1651 GCCACGGACA AGTCTTTCGT GGAGAAGCTG TGCACGGAGC AGGGCAGCCA 
1701 CCCCAAGTTC CAGAAGCCCA AGCAGCTCAA GGACAAGACT GAGTTCTCCA 
1751 TCATCCATTA TGCTGGGAAG GTGGACTATA ATGCGAGTGC CTGGCTGACC 
1801 AAGAATATGG ACCCGCTGAA TGACAACGTG ACTTCCCTGC TCAATGCCTC 
1851 CTCCGACAAG TTTGTGGCCG AC CTGTGGAA GGACGTGGAC CGCATCGTGG 
1901 GCCTGGACCA GATGGCCAAG ATGACGGAGA GCTCGCTGCC CAGCGCCTCC 
1951 AAGACCAAGA AGGGCATGTT CCGCACAGTG GGGCAGCTGT ACAAGGAGCA 
2 001 GCTGGGCAAG CTGATGACCA CGCTACGCAA CACCACGCCC AACTTCGTGC 
2 051 GCTGCATCAT CCCCAACCAC GAGAAGAGGT CCGGCAAGCT GGATGCGTTC 
2101 CTGGTGCTGG AGCAGCTGCG GTGCAATGGG GTGCTGGAAG GCATTCGCAT 
2151 CTGCCGGCAG GGCTTCCCCA ACCGGATCGT CTTCCAGGAG TTCCGCCAAC 
2201 GCTACGAGAT CCTGGCGGCG AATGCCATCC CCAAAGGCTT CATGGACGGG 
2251 AAGCAGGCCT GCATTCTCAT GATCAAAGCC CTGGAACTTG ACCCCAACTT 
2301 ATACAGGATA GGGCAGAGCA AAATCTTCTT CCGAACTGGC GTCCTGGCCC 
2351 AC CTAGAGGA GGAGCGAGAT TTGAAGATCA C CGATGTC AT CATGGCCTTC 
24 01 CAGGCGATGT GTCGTGGCTA CTTGGCCAGA AAGGCTTTTG CCAAGAGGCA 
2451 GCAGCAGCTG ACCGCCATGA AGGTGATTCA GAGGAACTGC GCCGCCTACC 
2501 TCAAGCTGCG GAACTGGCAG TGGTGGAGGC TTTTCACCAA AGTGAAGCCA 
2551 CTGCTGCAGG TGACACGGCA GGAGGAGGAG ATGCAGGCCA AGGAGGATGA 
2 601 ACTGCAGAAG ACCAAGGAGC GGCAGCAGAA GGCAGAGAAT GAGCTTAAGG 
2 651 AGCTGGAACA GAAGCACTCG CAGCTGACCG AGGAGAAGAA CCTGCTACAG 
2 701 GAACAGCTGC AGGCAGAGAC AGAGCTGTAT GCAGAGGCTG AGGAGATGCG 
2 751 GGTGCGGCTG GCGGCCAAGA AGCAGGAGCT GGAGGAGATA CTGCATGAGA 
2 801 TGGAGGCCCG CCTGGAGGAG GAGGAAGACA GGGGCCAGCA GCTACAGGCT 
2851 GAAAGGAAGA AGATGGCCCA GCAGATGCTG GACCTTGAAG AACAGCTGGA 
2901 GGAGGAGGAA GCTGCCAGGC AGAAGCTGCA ACTTGAGAAG GTCACGGCTG 
2951 AGGC C AAGAT CAAGAAACTG GAGGATGAGA TCCTGGTCAT GGATGATCAG 
3001 AACAATAAAC TATCAAAAGA ACGAAAACTC CTTGAGGAGA GGATTAGTGA 
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3 051 CTTAACGACA AATCTTGCAG AAGAGGAAGA AAAGGCCAAG AATCTTACCA 

3101 AGCTGAAAAA CAAGCATGAA TCTATGATTT CAGAACTGGA AGTGCGGCTA 

3151 AAGAAGGAAG AGAAGAGCCG ACAGGAGCTG GAGAAGCTGA AACGGAAGCT 

3201 GGAGGGTGAT GCCAGCGACT TCCACGAGCA GATCGCTGAC CTCCAGGCGC 

3251 AGATCGCAGA GCTCAAGATG CAGCTGGCCA AGAAGGAGGA GGAGCTGCAG 

3301 GCGGCCCTGG CCAGGCTTGA CGATGAAATC GCTCAGAAGA ACAATGCCCT 

3351 GAAGAAGATC CGGGAGCTGG AGGGCCACAT CTCAGACCTC CAGGAGGACC 

34 01 TGGACTCAGA GCGGGCCGCC AGGAACAAGG CTGAAAAGCA GAAGCGAGAC 

34 51 CTCGGCGAGG AGCTGGAGGC CCTAAAGACA GAGCTGGAAG ACACACTGGA 

3 501 CAGCACAGCC ACTCAGCAGG AGCTCAGGGC CAAGAGGGAG CAGGAGGTGA 

3551 CGGTGCTGAA GAAGGCCCTG GATGAAGAGA CGCGGTCCCA TGAGGCTCAG 

g 3601 GTCCAGGAGA TGAGGCAGAA ACACGCACAG GCGGTGGAGG AGCTCACAGA 

f 3651 GCAGCTTGAG CAGTTCAAGA GGGCCAAGGC GAACCTAGAC AAGAATAAGC 

W 37 01 AGACGCTGGA GAAAGAGAAC GCAGACCTGG CCGGGGAGCT GCGGGTCCTG 

M 

Ul 37 51 GGCCAGGCCA AGCAGGAGGT GGAACATAAG AAGAAGAAGC TGGAGGCGCA 

m 

O 3 951 GGCGTCCCTC AGTTCCCAGC TCCAGGACAC CCAGGAGCTG CTTCAAGAAG 

Q 

U> 4001 AAACCCGGCA GAAGCTCAAC GTGTCTACGA AGCTGCGCCA GCTGGAGGAG 

4 051 GAGCGGAACA GCCTGCAAGA CCAGCTGGAC GAGGAGATGG AGGCCAAGCA 

4101 GAACCTGGAG CGCCACATCT CCACTCTCAA CATCCAGCTC TCCGACTCGA 

4151 AGAAGAAGCT GCAGGACTTT GCCAGCACCG TGGAAGCTCT GGAAGAGGGG 

42 01 AAGAAGAGGT TCCAGAAGGA GATCGAGAAC CTCACCCAGC AGTACGAGGA 
4251 GAAGGCGGCC GCTTATGATA AACTGGAAAA GACCAAGAAC AGGCTTCAGC 

43 01 AGGAGCTGGA CGACCTGGTT GTTGATTTGG AC AAC CAGCG GCAACTCGTG 

43 51 TCCAACCTGG AAAAGAAGCA GAGGAAATTT GATCAGTTGT TAGC CGAGGA 

44 01 G AAAAAC AT C TCTTCCAAAT ACGCGGATGA GAGGGACAGA GCTGAGGCAG 
44 51 AAGCCAGGGA GAAGGAAACC AAGGCCCTGT CCCTGGCTCG GGCCCTTGAA 
4 5 01 GAGGCCTTGG AAGCCAAAGA GGAACTCGAG CGGACCAACA AAATGCTCAA 
4 551 AGCCGAAATG GAAGACCTGG TCAGCTCCAA GGATGACGTG GGCAAGAACG 
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3 8 01 GGTGCAGGAG CTGCAGTCCA AGTGCAGCGA TGGGGAGCGG GCCCGGGCGG 
3 851 AGCTCAATGA CAAAGTCCAC AAGCTGCAGA ATGAAGTTGA GAGCGTCACA 
3 901 GGGATGCTTA ACGAGGCCGA GGGGAAGGCC ATTAAGCTGG CCAAGGACGT 
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1551 MKTQLEELED ELQATEDAKL 

1601 LQRQLHEYET ELEDERKQRA 

1651 IKQLRKLQAQ MKDFQRELED 

17 01 EDLAAAERAR KQADLEKEEL 

17 51 ELEEEQGNME AMSDRVRKAT 

1801 NKELRSKLHE MEGAVKSKFK 

1851 LKQKDKKLKE ILLQVEDERK 

1901 QRINANRRKL QRELDEATES 



RLEVNMQALK GQFERDLQAR DEQNEEKRRQ 
LAAAAKKKLE GDLKDLELQA DSAIKGREEA 
ARASRDEIFA TAKENEKKAK SLEADLMQLQ 
AEELAS SLSG RNALQDEKRR LEARIAQLEE 
QQAEQLSNEL ATERSTAQKN ESARQQLERQ 
STIAALEAKI AQLEEQVEQE AREKQAATKS 
MAEQYKEQAE KGNARVKQLK RQLEEAEEES 
NEAMGREVNA LKSKLRGPPP QETSQ 
FIG. 2B 



ru 



1 ATGGCGCAGA AGGGCCAACT CAGTGACGAT GAGAAGTTCC TCTTTGTGGA 

51 CAAAAACTTC ATCAACAGCC CAGTGGCCCA GGCTGACTGG GCCGCCAAGA 

101 GACTCGTCTG GGTCCCCTCG GAGAAGCAGG GCTTCGAGGC AGCCAGCATT 

151 AAGGAGGAGA AGGGGGATGA GGTGGTTGTG GAGCTGGTGG AGAATGGCAA 

2 01 GAAGGTCACG GTTGGGAAAG ATGACATCCA GAAGATGAAC CCACCCAAGT 
251 TCTCCAAGGT GGAGGACATG GCGGAGCTGA CGTGCCTCAA CGAAGCCTCC 

3 01 GTGCTACACA ACCTGAGGGA GCGGTACTTC TCAGGGCTAA TATATACGTA 
351 CTCTGGCCTC TTCTGCGTGG TGGTCAACCC CTATAAACAC CTGCCCATCT 
401 ACTCGGAGAA GATCGTCGAC ATGTACAAGG GCAAGAAGAG GCACGAGATG 
451 CCGCCTCACA TCTACGCCAT CGCAGACACG GCCTACCGGA GCATGCTTCA 
501 AGATCGGGAG GAC C AGTCC A TTCTATGCAC AGGCGAGTCT GGAGCCGGGA 
551 AAACCGAAAA C AC C AAGAAG GTCATTCAGT ACCTGGCCGT GGTGGCCTCC 
601 TCCCACAAGG GCAAGAAAGA CACAAGTATC ACGCAAGGCC CATCTTTTGC 
651 CTACGGAGAG CTGGAAAAGC AGCTTCTACA AGCAAACCCG ATTCTGGAGG 
701 CTTTCGGCAA CGCCAAAACA GTGAAGAACG ACAACTCCTC ACGATTCGGC 
751 AAATTCATCC GCATCAACTT CGACGTCACG GGTTACATCG TGGGAGC CAA 
8 01 CATTGAGACC TATCTGCTAG AAAAATCACG GGCAATTCGC CAAGC CAGAG 
851 ACGAGAGGAC ATTCCACATC TTTTACTACA TGATTGCTGG AGCCAAGGAG 
901 AAGATGAGAA GTGACTTGCT TTTGGAGGGC TTCAACAACT ACACCTTCCT 
951 CTC CAATGGC TTTGTGCCCA TCCCAGCAGC CCAGGATGAT GAGATGTTCC 

10 01 AGGAAACCGT GGAGGCCATG GCAATCATGG GTTTCAGCGA GGAGGAGCAG 
1051 CTATC CATAT TGAAGGTGGT ATCATCGGTC CTGCAGCTTG GAAAT AT CGT 
1101 CTTCAAGAAG GAAAGAAACA CAGACCAGGC GTCCATGCCA GATAACACAG 
1151 CTGCTCAGAA AGTTTGCCAC CTCATGGGAA TTAATGTGAC AGATTTCACC 
12 01 AGATCCATCC TCACTCCTCG TATCAAGGTT GGGCGAGATG TGGTACAGAA 

12 51 AGCTCAGACA AAAGAACAGG CTGACTTTGC TGTAGAGGCT TTGGCCAAGG 

13 01 CAACATATGA GCGCCTTTTC CGCTGGATAC TCACCCGCGT GAACAAAGCC 
1351 CTGGACAAGA CCCATCGGCA AGGGGCTTCC TTCCTGGGGA TC CTGGAT AT 

14 01 AGCTGGATTT GAGATCTTTG AGGTGAACTC CTTCGAGCAG CTGTGCATCA 

14 51 ACTACACCAA CGAGAAGCTG CAGCAGCTCT TCAACCACAC C ATGTTC AT C 

15 01 CTGGAGCAGG AGGAGTACCA GCGCGAGGGC ATCGAGTGGA ACTTCATCGA 
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15 51 CTTTGGGCTG GACCTACAGC CCTGCATCGA GCTCATCGAG CGACCGAACA 

16 01 ACCCTCCAGG TGTGCTGGCC CTGCTGGACG AGGAATGCTG GTTCCCCAAA 
1651 GCCACGGACA AGTCTTTCGT GGAGAAGCTG TGCACGGAGC AGGGCAGCCA 

17 01 CCCCAAGTTC CAGAAGCCCA AGCAGCTCAA GGACAAGACT GAGTTCTCCA 

17 51 TCATCCATTA TGCTGGGAAG GTGGACTATA ATGCGAGTGC CTGGCTGACC 

18 01 AAGAATATGG ACCCGCTGAA TGACAACGTG ACTTCCCTGC TCAATGCCTC 
1851 CTCCGACAAG TTTGTGGCCG AC CTGTGGAA GGACGTGGAC CGCATCGTGG 
1901 GCCTGGACCA GATGGCCAAG ATGACGGAGA GCTCGCTGCC CAGCGCCTCC 
1951 AAGACCAAGA AGGGCATGTT CCGCACAGTG GGGCAGCTGT ACAAGGAGCA 
2 001 GCTGGGCAAG CTGATGACCA CGCTACGCAA CACCACGCCC AACTTCGTGC 
2 051 GCTGCATCAT CCCCAACCAC GAGAAGAGGT CCGGCAAGCT GGATGCGTTC 
2101 CTGGTGCTGG AGCAGCTGCG GTGCAATGGG GTGCTGGAAG GCATTCGCAT 
2151 CTGCCGGCAG GGCTTCCCCA ACCGGATCGT CTTCCAGGAG TTCCGCCAAC 

W 22 01 GCTACGAGAT CCTGGCGGCG AATGCCATCC CCAAAGGCTT CATGGACGGG 

U1 22 51 AAGCAGGCCT GCATTCTCAT GATCAAAGCC CTGGAACTTG ACCCCAACTT 

2 3 01 ATACAGGATA GGGCAGAGCA AAATCTTCTT CCGAACTGGC GTCCTGGCCC 

2 351 AC CTAGAGGA GGAGCGAGAT TTGAAG AT C A CCGATGTCAT CATGGCCTTC 



M 

Q 24 01 CAGGCGATGT GTCGTGGCTA CTTGGCCAGA AAGGCTTTTG CCAAGAGGCA 

C3 24 51 GCAGCAGCTG ACCGCCATGA AGGTGATTCA GAGGAACTGC GCCGCCTACC 

o 

M> 2 501 TCAAGCTGCG GAACTGGCAG TGGTGGAGGC TTTTCACCAA AGTGAAGC C A 

2 551 CTGCTGCAGG TGACACGGCA GGAGGAGGAG ATGCAGGCCA AGGAGGATGA 

2 601 ACTGCAGAAG ACCAAGGAGC GGCAGCAGAA GGCAGAGAAT GAGCTTAAGG 

2 651 AGCTGGAACA GAAGCACTCG CAGCTGACCG AGGAGAAGAA CCTGCTACAG 

2 7 01 GAACAGCTGC AGGCAGAGAC AGAGCTGTAT GCAGAGGCTG AGGAGATGCG 

2 751 GGTGCGGCTG GCGGCCAAGA AGCAGGAGCT GGAGGAGATA CTGCATGAGA 

2 801 TGGAGGCCCG CCTGGAGGAG GAGGAAGACA GGGGCCAGCA GCTACAGGCT 

2 851 GAAAGGAAGA AGATGGCCCA GCAGATGCTG GACCTTGAAG AACAGCTGGA 

2 901 GGAGGAGGAA GCTGCCAGGC AGAAGCTGCA AC TTG AGAAG GTCACGGCTG 

2 951 AGGCCAAGAT CAAGAAACTG GAGGATGAGA TC CTGGTC AT GGATGATCAG 

3 0 01 AACAATAAAC TATCAAAAGA ACGAAAACTC CTTGAGGAGA GGATTAGTGA 
3 051 CTTAACGACA AATCTTGCAG AAGAGGAAGA AAAGGCCAAG AATCTTACCA 
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4651 ATGAAGACGC AGCTGGAAGA GCTGGAGGAC GAGCTGCAAG CCACGGAGGA 

4 701 CGCCAAACTG CGGCTGGAAG TCAACATGCA GGCGCTCAAG GGCCAGTTCG 

4 751 AAAGGGATCT CCAAGCCCGG GACGAGCAGA ATGAGGAGAA GAGGAGGCAA 

4801 CTGCAGAGAC AGCTTCACGA GTATGAGACG GAACTGGAAG ACGAGCGAAA 

4 851 GCAACGTGCC CTGGCAGCTG CAGCAAAGAA GAAGCTGGAA GGGGACCTGA 

4 901 AAG AC CTGGA GCTTCAGGCC GACTCTGCCA TCAAGGGGAG GGAGGAAGCC 

4 951 ATCAAGCAGC TACGCAAACT GCAGGCTCAG ATGAAGGACT TTCAAAGAGA 

5001 GCTGGAAGAT GCCCGTGCCT CCAGAGATGA GATCTTTGCC ACAGCCAAAG 

5051 AGAATGAGAA GAAAGCCAAG AGCTTGGAAG CAGACCTCAT GCAGCTACAA 

5101 GAGGACCTCG CCGCCGCTGA GAGGGCTCGC AAACAAGCGG ACCTCGAGAA 

5151 GGAGGAACTG GCAGAGGAGC TGGCCAGTAG CCTGTCGGGA AGGAACGCAC 

52 01 TCCAGGACGA GAAGCGCCGC CTGGAGGCCC GGATCGCCCA GCTGGAGGAG 

52 51 GAGCTGGAGG AGGAGCAGGG CAACATGGAG GCCATGAGCG ACCGGGTCCG 

53 01 CAAAGCCACA CAGCAGGCCG AGCAGCTCAG CAACGAGCTG GCCACAGAGC 

53 51 GCAGCACGGC CCAGAAGAAT GAGAGTGCCC GGCAGCAGCT CGAGCGGCAG 

54 01 AACAAGGAGC TCCGGAGCAA GCTCCACGAG ATGGAGGGGG CCGTCAAGTC 
54 51 CAAGTTCAAG TCCACCATCG CGGCGCTGGA GGCCAAGATT GCACAGCTGG 
5501 AGGAGCAGGT CGAGCAGGAG GCCAGAGAGA AACAGGCGGC CACCAAGTCG 
5551 CTGAAGCAGA AAGACAAGAA GCTGAAGGAA ATCTTGCTGC AGGTGGAGGA 
5601 CGAGCGCAAG ATGGCCGAGC AGTACAAGGA GCAGGCAGAG AAAGGCAATG 
5651 CCAGGGTCAA GCAGCTCAAG AGGCAGCTGG AGGAGGCAGA GGAGGAGTCC 
57 01 CAGCGCATCA ACGCCAACCG CAGGAAGCTG CAGCGGGAGC TGGATGAGGC 
5751 CACGGAGAGC AACGAGGCCA TGGGCCGCGA GGTGAACGCA CTCAAGAGCA 
5801 AGCTCAGGCG AGGAAACGAG ACCTCTTTCG TTCCTTCTAG AAGGTCTGGA 
5851 GGACGTAGAG TTATTGAAAA TGCAGATGGT TCTGAGGAGG AAACGGACAC 
5901 TCGAGACGCA GACTTCAATG GAACCAAGGC CAGTGAA 
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^3 601 KNMDPLNDNV TSLLNASSDK FVADLWKDVD RIVGLDQMAK MTESSLPSAS 

ru 

\j 651 KTKKGM FRTV GQLYKEQLGK LMTTLRNTTP NFVRCIIPNH EKRSGKLiDAF 

701 LVLEQLRCNG VLEGIRICRQ GFPNRIVFQE FRQRYEILAA NAI PKGFMDG 

N ?51 KQACILMIKA LELDPNLYRI GQSKIFFRTG VLAHLEEERD LKI TDVIMAF 

O 801 QAMCRGYLAR KAFAKRQQQL TAMKVIQRNC AAYLKLRNWQ WWRLFTKVKP 

M' 851 LLQVTRQEEE MQAKEDELQK TKERQQKAEN ELKELEQKHS QLTEEKNLLQ 

a 

p 901 EQLQAETELY AEAEEMRVRL AAKKQELEEI LHEMEARLEE EEDRGQQLQA 

951 ERKKMAQQML DLEEQLEEEE AARQKLQLEK VTAEAKI KKL EDEILVMDDQ 
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1151 LGEELEALKT ELEDTLDSTA TQQELRAKRE QEVTVLKKAL DEETRSHEAQ 

12 01 VQEMRQKHAQ AVEELTEQLE QFKRAKANLD KNKQTLEKEN ADLAGELRVL 

12 51 GQAKQEVEHK KKKLEAQVQE LQSKCSDGER ARAELNDKVH KLQNEVESVT 

13 01 GMLNEAEGKA IKLAKDVASL SSQLQDTQEL LQEETRQKLN VSTKLRQLEE 
1351 ERNSLQDQLD EEMEAKQNLE RHISTLNIQL SDSKKKLQDF ASTVEALEEG 

14 01 KKRFQKE IEN LTQQYEEKAA AYDKLEKTKN RLQQEL.DDLV VDLDNQRQLV 

14 51 SNLEKKQRKF DQLLAEEKNI SSKYADERDR AEAEAREKET KALSLARALE 

15 01 EALEAKEELE RTNKMLKAEM EDLVSSKDDV GKNVHELEKS KRALETQMEE 
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INSPVAQADW AAKRLVWVPS EKQGFEAAS I 
VGKDD I QKMN PPKFSKVEDM AELTCLNEAS 
FCVWNPYKH LPIYSEKIVD MYKGKKRHEM 
DQSILCTGES GAGKTENTKK V I QYLAWAS 
LEKQLLQANP I LEAFGNAKT VKNDNS SRFG 
YLLEKSRAIR QARDERTFHI FYYMIAGAKE 
FVPIPAAQDD EMFQETVEAM AIMGFSEEEQ 
ERNTDQASMP DNTAAQKVCH LMGINVTDFT 
KEQAD FAVE A LAKATYERLF RW I LTRVNKA 
EIFEVNSFEQ LCINYTNEKL QQLFNHTMFI 
DLQPCIELIE RPNNPPGVLA LLDEECWFPK 
QKPKQLKDKT EFSIIHYAGK VDYNASAWLT 
FVADLWKDVD RIVGLDQMAK MTESSLPSAS 
LMTTLRNTTP NFVRCIIPNH EKRSGKLDA 
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MAQKGQLSDD EKFLFVDKNF INSPVAQADW AAKRLVWVPS EKQGFEAASI 
KEEKGDEVW ELVENGKKVT VGKDD I QKMN PPKFSKVEDM AELTCLNEAS 
VLHNLiRERYF SGLIYTYSGL FCVWNPYKH LPIYSEKIVD MYKGKKRHEM 
PPHIYAIADT AYRSMLQDRE DQSILCTGES GAGKTENTKK VI QYLAWAS 
SHKGKKDTS I TQGPSFAYGE LEKQLLQANP I LEAFGNAKT VKNDNS SRFG 
KFIRINFDVT GYIVGANIET YLLEKSRAIR QARDERTFHI FYYMIAGAKE 
KMRSDLLLEG FNNYTFLSNG FVPIPAAQDD EMFQETVEAM AIMGFSEEEQ 
LSILKWSSV LQLGNIVFKK ERNTDQASMP DNTAAQKVCH LMGINVTDFT 
RSILTPRIKV GRDWQKAQT KEQAD F AVE A LAKATYERLF RW I LTRVNKA 
LDKTHRQGAS FLGILDIAGF EIFEVNSFEQ LCINYTNEKL QQLFNHTMFI 
LEQEEYQREG IEWNFIDFGL DLQPCIELIE RPNNPPGVLA LLDEECWFPK 
ATDKS FVEKLt CTEQGSHPKF QKPKQLKDKT EFSIIHYAGK VDYNASAWLT 
KNMDPLNDNV TSLLNASSDK FVADLWKDVD RIVGLDQMAK MTESSLPSAS 
KTKKGMFRTV GQLYKEQLGK LMTTLRNTTP NFVRCIIPNH EKRSGKLDAF 
LVLEQLRCNG VLEGIRICRQ GFPNRIVFQE FRQRYEILAA NAIPKGFMDG 
KQACILMIKA LELDPNLYRI GQ 
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CTTTGGGCTG GACCTACAGC CCTGCATCGA GCTCATCGAG CG AC CGAAC A 
ACCCTCCAGG TGTGCTGGCC CTGCTGGACG AGGAATGCTG GTTC CC C AAA 
GCCACGGACA AGTCTTTCGT GGAGAAGCTG TGCACGGAGC AGGGCAGCCA 
CCCCAAGTTC CAGAAGCCCA AGCAGCTCAA GGACAAGACT GAGTTCTCCA 
TCATCCATTA TGCTGGGAAG GTGGACTATA ATGCGAGTGC CTGGCTGACC 
AAGAATATGG ACCCGCTGAA TGACAACGTG ACTTCCCTGC TCAATGCCTC 
CTCCGACAAG TTTGTGGCCG ACCTGTGGAA GGACGTGGAC CGCATCGTGG 
GCCTGGACCA GATGGCCAAG ATGACGGAGA GCTCGCTGCC CAGCGCCTCC 
AAGACCAAGA AGGGCATGTT CCGCACAGTG GGGCAGCTGT ACAAGGAGCA 
GCTGGGCAAG CTGATGACCA CGCTACGCAA CACCACGCCC AACTTCGTGC 
GCTGCATCAT CCCCAACCAC GAGAAGAGGT CCGGCAAGCT GGATGCGTTC 
CTGGTGCTGG AGCAGCTGCG GTGCAATGGG GTGCTGGAAG GCATTCGCAT 
CTGCCGGCAG GGCTTCCCCA ACCGGATCGT CTTC CAGGAG TTCCGCCAAC 
GCTACGAGAT CCTGGCGGCG AATGCCATCC CCAAAGGCTT CATGGACGGG 
AAGCAGGCCT GCATTCTCAT GATCAAAGCC CTGGAACTTG ACCCCAACTT 
ATACAGGATA GGGCAGAGCA AAATCTTCTT CCGAACTGGC GTCCTGGCCC 
ACCTAGAGGA GGAGCGAGAT TTGAAGATCA CCGATGTCAT CATGGCCTTC 
CAGGCGATGT GTCGTGGCTA CTTGGCCAGA AAGGCTTTTG CCAAGAGGCA 
GCAGCAGCTG ACCGCCATGA AGGTGATTCA GAGGAACTGC GCCGCCTACC 
TCAAGCTGCG GAACTGGCAG TGGTGGAGGC TTTTCACCAA AGTGAAG 
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MAQKGQLSDD EKFLFVDKNF INSPVAQADW AAKRLVWVPS EKQGFEAASI 
KEEKGDEVW ELVENGKKVT VGKDD I QKMN PPKFS KVEDM AELTCLNEAS 
VLHNLRERYF SGLIYTYSGL FCVWNPYKH LPIYSEKIVD MYKGKKRHEM 
PPHIYAIADT AYRSMLQDRE DQSILCTGES GAGKTENTKK VIQYLAWAS 
SHKGKKDTS I TQGPSFAYGE LEKQLLQANP I LEAFGNAKT VKNDNSSRFG 
KFIRINFDVT GYIVGANIET YLLEKSRAIR QARDERTFHI FYYMIAGAKE 
KMRSDLLLEG FNNYTFLSNG FVP I PAAQDD EMFQETVEAM AIMGFSEEEQ 
LSILKWSSV LQLGNIVFKK ERNTDQASMP DNTAAQKVCH LMGINVTDFT 
RSILTPRIKV GRDWQKAQT KEQADFAVEA LAKATYERLF RWILTRVNKA 
LDKTHRQGAS FLGILDIAGF EIFEVNSFEQ LCINYTNEKL QQLFNHTMFI 
LEQEEYQREG IEWNFIDFGL DLQPCIELIE RPNNPPGVLA LLDEECWFPK 
ATDKS FVEKL CTEQGSHPKF QKPKQLKDKT EFSIIHYAGK VDYNASAWLT 
KNMD PLNDNV TSLLNASSDK FVADLWKDVD RIVGLDQMAK MTESSLPSAS 
KTKKGMFRTV GQLYKEQLGK LMTTLRNTTP NFVRCIIPNH EKRSGKLDAF 
LVLEQLRCNG VLEGIRICRQ GFPNRIVFQE FRQRYEILAA NAIPKGFMDG 
KQACILMIKA LELDPNLYRI GQSKIFFRTG VLAHLEEERD LKITDVIMAF 
QAMCRGYLAR KAFAKRQQQL TAMKVIQRNC AAYLKLRNWQ WWRLFTKVK 
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1551 CTTTGGGCTG GACCTACAGC CCTGCATCGA GCTCATCGAG CGAC CGAAC A 

16 01 ACCCTCCAGG TGTGCTGGCC CTGCTGGACG AGGAATGCTG GTTCCCCAAA 
1651 GCCACGGACA AGTCTTTCGT GGAGAAGCTG TGCACGGAGC AGGGCAGCCA 

17 01 CCCCAAGTTC CAGAAGCCCA AGCAGCTCAA GGACAAGACT GAGTTCTCCA 
17 51 TCATCCATTA TGCTGGGAAG GTGGACTATA ATGCGAGTGC CTGGCTGACC 
1801 AAGAATATGG ACCCGCTGAA TGACAACGTG ACTTCCCTGC TCAATGCCTC 
1851 CTCCGACAAG TTTGTGGCCG AC CTGTGGAA GGACGTGGAC CGCATCGTGG 
1901 GCCTGGACCA GATGGC CAAG ATGACGGAGA GCTCGCTGCC CAGCGCCTCC 
1951 AAGACCAAGA AGGGCATGTT CCGCACAGTG GGGCAGCTGT ACAAGGAGCA 
2 001 GCTGGGCAAG CTGATGACCA CGCTACGCAA CACCACGCCC AACTTCGTGC 
2 051 GCTGCATCAT CCCCAACCAC GAGAAGAGGT CCGGCAAGCT GGATGCGTTC 
2101 CTGGTGCTGG AGCAGCTGCG GTGCAATGGG GTGCTGGAAG GCATTCGCAT 
2151 CTGCCGGCAG GGCTTCCCCA ACCGGATCGT CTTCCAGGAG TTCCGCCAAC 
22 01 GCTACGAGAT CCTGGCGGCG AATGCCATCC CCAAAGGCTT CATGGACGGG 

Ul 22 51 AAGCAGGCCT GCATTCTCAT GATCAAAGCC CTGGAACTTG ACCCCAACTT 

N 23 01 ATACAGGATA GGGCAGAGCA AAATCTTCTT CCGAACTGGC GTCCTGGCCC 

Q 2 3 51 ACCTAGAGGA GGAGCGAGAT TTGAAGATCA CCGATGTCAT CATGGCCTTC 

03 

^ 24 01 CAGGCGATGT GTCGTGGCTA CTTGGCCAGA AAGGCTTTTG CCAAGAGGCA 

^3 24 51 GCAGCAGCTG ACCGCCATGA AGGTGATTCA GAGGAACTGC GCCGCCTACC 

^ 2 501 TCAAGCTGCG GAACTGGCAG TGGTGGAGGC TTTTCACCAA AGTGAAGCCA 

2 551 CTGCTG 
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1 MAQKGQLSDD EKFLFVDKNF INSPVAQADW AAKRLVWVPS EKQGFEAASI 

51 KEEKGDEVW ELVENGKKVT VGKDD I QKMN PPKFSKVEDM AELTCLNEAS 

101 VLHNLRERYF SGLIYTYSGL FCVWNPYKH LPIYSEKIVD MYKGKKRHEM 

151 PPHIYAIADT AYRSMLQDRE DQSILCTGES GAGKTENTKK V I QYLAWAS 

201 SHKGKKDTS I TQGPSFAYGE LEKQLLQANP I LEAFGNAKT VKMDNSSRFG 

2 51 KFIRINFDVT GYIVGANIET YLLEKSRAIR QARDERTFH I FYYMIAGAKE 

3 01 KMRSDLLLEG FNNYTFLSNG FVPIPAAQDD EMFQETVEAM AIMGFSEEEQ 

3 51 LSILKWSSV LQLGNIVFKK ERNTDQASMP DNTAAQKVCH LMGINVTDFT 

4 01 RSILTPRIKV GRDWQ KAQT KE Q AD F AVE A LAKATYERLF RWILTRVNKA 
451 LDKTHRQGAS FLGILDIAGF EIFEVNSFEQ LCINYTNEKL QQLFNHTMFI 
501 LEQEEYQREG IEWNFIDFGL DLQPCIELIE RPNNPPGVLA LLDEECWFPK 
551 ATDKS FVEKL CTEQGSHPKF QKPKQLKDKT EFSIIHYAGK VDYNASAWLT 
601 KNMDPLNDNV TSLLNASSDK FVADLWKDVD RIVGLDQMAK MTESSLPSAS 
651 KTKKGMFRTV GQLYKEQLGK LMTTLRNTTP NFVRCIIPNH EKRSGKLDAF 

Ul 7 01 LVLEQLRCNG VLEGIRICRQ GFPNRIVFQE FRQRYE I LAA NAI PKGFMDG 

S$ 751 KQACILMIKA LELDPNL YR I GQSKIFFRTG VLAHLEEERD LKITDVIMAF 



4% 
■05 

ru 



n 

Q 



801 QAMCRGYLAR KAFAKRQQQL TAMKVI QRNC AAYLKLRNWQ WWRLFTKVKP 
851 LL 
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1 ATGGCGCAGA AGGGCCAACT CAGTGACGAT GAGAAGTTCC TCTTTGTGGA 

51 CAAAAACTTC ATCAACAGCC CAGTGGCCCA GGCTGACTGG GCCGCCAAGA 

101 GACTCGTCTG GGTCCCCTCG GAGAAGCAGG GCTTCGAGGC AGCCAGCATT 

151 AAGGAGGAGA AGGGGGATGA GGTGGTTGTG GAGCTGGTGG AGAATGGCAA 

2 01 GAAGGTCACG GTTGGGAAAG ATG AC AT C C A GAAGATGAAC CCACCCAAGT 
251 TCTCCAAGGT GGAGGACATG GCGGAGCTGA CGTGCCTCAA CGAAGCCTCC 

3 01 GTGCTACACA ACCTGAGGGA GCGGTACTTC TCAGGGCTAA TATATACGTA 
351 CTCTGGCCTC TTCTGCGTGG TGGTCAACCC CTATAAACAC CTGCCCATCT 

4 01 ACTCGGAGAA GATCGTCGAC ATGTACAAGG GCAAGAAGAG GCACGAGATG 
451 CCGCCTCACA TCTACGCCAT CGCAGACACG GCCTACCGGA GCATGCTTCA 
501 AGATCGGGAG GACCAGTCCA TTCTATGCAC AGGCGAGTCT GGAGCCGGGA 
551 AAACCGAAAA CACCAAGAAG GTCATTCAGT ACCTGGCCGT GGTGGCCTCC 
601 TCCCACAAGG GCAAGAAAGA CACAAGTATC ACGGGAGAGC TGGAAAAGCA 



M 651 GCTTCTACAA GCAAACCCGA TTCTGGAGGC TTTCGGCAAC GCCAAAACAG 

m 

€1 7 01 TGAAGAACGA CAACTCCTCA CGATTCGGCA AATTCATCCG CATCAACTTC 



751 GACGTCACGG GTTACATCGT GGGAGCCAAC ATTGAGACCT ATCTGCTAGA 

801 AAAATCACGG GCAATTCGCC AAGCCAGAGA CGAGAGGACA TTCCACATCT 

851 TTTACTACAT GATTGCTGGA GCCAAGGAGA AGATGAGAAG TGACTTGCTT 

901 TTGGAGGGCT TCAACAACTA CACCTTCCTC TCCAATGGCT TTGTGCCCAT 

951 CCCAGCAGCC CAGGATGATG AGATGTTCCA GGAAACCGTG GAGGCCATGG 

10 01 CAATCATGGG TTTCAGCGAG GAGGAGCAGC TATCCATATT GAAGGTGGTA 

10 51 TCATCGGTCC TGCAGCTTGG AAATATCGTC TTCAAGAAGG AAAGAAACAC 

1101 AG AC C AGGCG TCCATGCCAG ATAACACAGC TGCTCAGAAA GTTTGCCACC 

1151 TCATGGGAAT TAATGTGACA GATTTCACCA GATCCATCCT CACTCCTCGT 

12 01 ATCAAGGTTG GGCGAGATGT GGTACAGAAA GCTCAGACAA AAGAACAGGC 

12 51 TGACTTTGCT GTAGAGGCTT TGGCCAAGGC AACATATGAG CGCCTTTTCC 

13 01 GCTGGATACT CACCCGCGTG AACAAAGCCC TGGACAAGAC CCATCGGCAA 

13 51 GGGGCTTCCT TCCTGGGGAT CCTGGATATA GCTGGATTTG AGATCTTTGA 

14 01 GGTGAACTCC TTCGAGCAGC TGTGCATCAA CTACACCAAC GAGAAGCTGC 

14 51 AGCAGCTCTT CAACCACACC ATGTTCATCC TGGAGCAGGA GGAGTACCAG 

15 01 CGCGAGGGCA TCGAGTGGAA CTTCATCGAC TTTGGGCTGG ACCTACAGCC 
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15 51 CTGCATCGAG CTCATCGAGC GACCGAACAA CCCTCCAGGT GTGCTGGCCC 
16 01 TGCTGGACGA GGAATGCTGG TTCCCCAAAG CCACGGACAA GTCTTTCGTG 

16 51 GAGAAGCTGT gcacggagca gggcagccac cccaagttcc agaagcccaa 
17 01 gcagctcaag gacaagactg agttctccat catccattat gctgggaagg 

175 1 TGGACTATAA TGCGAGTGC C TGGCTGACCA AGAATATGGA CCCGCTGAAT 
18 01 GACAACGTGA CTTCCCTGCT CAATGCCTCC TCCGACAAGT TTGTGGCCGA 
XSSI CCTGTGGAAG GACGTGGACC GCATCGTGGG CCTGGACCAG ATGGC CAAGA 
190 1 TGACGGAGAG CTCGCTGCCC AGCGCCTCCA AGACCAAGAA GGGCATGTTC 
1951 CGCACAGTGG GGCAGCTGTA CAAGGAGCAG CTGGGCAAGC TGATGACCAC 
20 0X GCTACGCAAC ACCACGCCCA ACTTCGTGCG CTGCATCATC CCCAACCACG 
20 51 AGAAGAGGTC CGGCAAGCTG GATGCGTTCC TGGTGCTGGA GCAGCTGCGG 
210 1 TGCAATGGGG TGCTGGAAGG CATTCGCATC TGCCGGCAGG GCTTCCCCAA 
2151 CCGGATCGTC TTCCAGGAGT TCCGCCAACG CTACGAGATC CTGGCGGCGA 
22 01 ATGCCATCCC CAAAGGCTTC ATGGACGGGA AGCAGGCCTG CATTCTCATG 
2251 ATCAAAGCCC TGGAACTTGA CCCCAACTTA TACAGGATAG GGCAGAGCAA 
23 0X AATCTTCTTC CGAACTGGCG TCCTGGCCCA CCTAGAGGAG GAGCGAGATT 
23 5X TGAAGATCAC CGATGTCATC ATGGCCTTCC AGGCGATGTG TCGTGGCTAC 
240 1 TTGGCCAGAA AGGCTTTTGC CAAGAGGCAG CAGCAGCTGA CCGCCATGAA 
245 1 GGTGATTCAG AGGAACTGCG CCGCCTACCT CAAGCTGCGG AACTGGCAGT 
2501 GGTGGAGGCT TTTCACCAAA GTGAAGCCAC TGCTG 
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651 KTKKGMFRTV GQXiYKEQIiGK LMTTLRNTTP HPVRCXXPHH EKRSGKLDA 

FIG. 14 



